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Abstract — We consider the communication channel given 
by a fiber optical transmission line. We develop a method to 
perturbatively calculate the information capacity of a non- 
linear channel, given the corresponding evolution equation. 
Using this technique, we compute the decrease of the chan- 
nel capacity to the leading order in the perturbative param- 
eter for fiber optics communication systems. 



I. Introduction 

TH e performance of any communication system is ul- 
timately limited by the signal to noise ratio of the re- 
ceived signal and available bandwidth. This limitation can 
be stated more formally by using the concept of channel 
capacity introduced withing the framework of information 
theory [0. The channel capacity is defined as the maximum 
possible bit rate for error-free transmission in the presence 
of noise. For a linear communication channel with additive 
white Gaussian noise, and a total signal power constraint at 
the input, the capacity is given by the celebrated Shannon 
formula 

C = Wlog(l + ^] (1) 



Pi 



N 



where W is the channel bandwidth, Pq is the average signal 
power, and Pn is the average noise power. 

Current optical fiber systems operate substantially below 
the fundamental limitation, imposed by the Eq. (^. How- 
ever, a considerable improvement in the coding schemes 
for lightwave communications, expected in the near future, 
may result in the development of systems, whose efficiency 
may approach this fundamental limit. 

However, the representation of the channel capacity in 
the standard form (]l|) is unsuitable for applications to the 
actual fiber optics systems. It was obtained based on 
the assumption of linearity of the communication chan- 
nel, while the modern fiber optics systems operate in a 
substantially nonlinear regime. Since the optical transmis- 
sion lines must satisfy very strict requirements for bit-error- 
rate (lO"^'^ to 10~^^), the pulse amplitude should be large 
enough so that is can be effectively detectable. The in- 
crease of the number of wavelength-division multiplexing 
(WDM) channels]^ in the modern fiber optics communi- 
cation systems also leads to a substantial increase of the 
electric field intensity in the fiber. As a consequence, the 
Kerr nonlinearity of the fiber refractive index = no + 7/ 
(where / is the pulse intensity) becomes substantial and 
should be taken into account. 

In the present paper we consider corrections to the chan- 
nel capacity of the optical fiber communication system, 
originating from the nonlinearity of the fiber. The tech- 
nique that we use involves a perturbative computation of 



the relevant mutual information and subsequent optimiza- 
tion. To our knowledge, this method appears to be sub- 
stantially new. 

II. Fiber Optics Communication System as an 
Information Channel 

We consider a typical fiber optics communication sys- 
tem, which consists of a sequence of N fibers each followed 
by an amplifier (see Fig. 1). The amplifiers have to be in- 
troduced in order to compensate for the power loss in the 
fiber. An inevitable consequence of such design, however, is 
the generation of the noise in the system, coming from the 
spontaneous emission in the optical amplifiers. For sim- 
plicity, we will assume that all the fibers and the amplifiers 
of the link are identical. 

The information is encoded in the electric field at the 
"imput" of the system, typically using the light pulses sent 
at different frequencies. The available bandwidth of the 
amplifiers as well as the increase of the fiber absorption 
away from the "transparency window" near the wavelength 
A = 1.55/xm, limits the bandwidth of the fiber optic com- 
munication system. 

The maximum amount of the information, that can be 
transmitted through the communication system per unit 
time, is called the channel capacity C. According to the 
Shannon's basic result Q, this quantity is given by the max- 
imum value of the mutual information per second over all 
possible input distributions: 

C = ms^^pAH[y]-{H[y\x])pJ (2) 

The mutual information 

R = H[y{oj)]-{H[y{uj)\xiLo)])p^ (3) 

is a functional of the "input distribution" [x (lo)], which 
represents the encoding of the information using the elec- 
tric field components at different frequences 

Eia (t) — du! X (w) exp {iojt) (4) 
Jw 

The entropy H [y (cu)] is the measure of the information re- 
ceived at the output of the communication channel. How- 
ever, if the channel is noisy, for any output signal there 
is some uncertainty of what was originally sent. The con- 
ditional entropy H [y {uj)\x (uj)] at the output for a given 
X (lo) represents this uncertainty. 

The entropies H [y (w)] and H [y {uj)\x {lu)] are defined in 
terms of the corresponding distributions p{y) and p{y\x) 
via the standard relation 



H = - Vy{uj)p[y (oj)] log {p[y{uj)] ) 



(5) 
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Fig. 1. The schematical representation of a fiber optics communica- 
tion channel 



where p = Py{y) for the entropy H [y (w)], and p = p{y \ x) 
for the entropy H [y {lo)\x (lo)], and the functional integral 
is defined in the standard way 



(lu) — lim Cm 



n 



M 

m—1 



(6) 



where is a normalization constant. 

For any communication link, the signal power is limited 
by the system hardware. Therefore, the maximum of the 
mutual information in (|l|) should be found under the con- 
straint of the fixed total power Pq at the input: 



Vx (uj) \x (w)| px [x (uj)] 



(7) 



If the propagation in the communication channel is de- 
scribed by a linear equation, then the input-output relation 
for the system is given by 



y (uj) — K {uj) X {uj) + n (w) 



(8) 



where n {uj) is the noise in the channel. In this approxima- 
tion, the problem of finding the maximum of the mutual 
information can be solved exactly, with the corresponding 
input distribution px being Gaussian||l|. If the amplifiers 
compensate exactly for the power losses in the fibers, the 
Channel Capacity is given by the Shannon formula (^. 

As follows from (|l|), the better bit rates can be obtained 
for the higher signal-to- noise ratio Pq/P/v- With this in 
mind, the optics fiber communication systems are designed 
to operate with the pulses of high power. As a result, the 
optics fiber links operate in the regime, in which due to the 
Kerr nonlinearity the refraction index of the fiber strongly 
depends on the local electric field intensity. Therefore, a 
modern fiber optics communication system is, in fact, an 
essentially nonlinear communication channel, and cannot 
be adequately described within the framework of the Shan- 
non's linear theory. 

III. The Model 

The first step in the calculation of the channel capacity is 
to find the "input-output" relation for the communication 
channel. The time evolution of the electric field in the 
fiber E{z, t), where z is the distance along the fiber, can be 
accuarately described in the "envelope approximation" ||^ , 
when 

E{z,t) = A {z,t) exp {i {Pnz - ujot)) + c.c. (9) 

where the function A represents the slowly (compared to 
the light frequency) varying amplitude of the electric field 



in the fiber. The evolution of A {z, t) is described by the 
equation 



OA OA i d'^A 



-A^tj\A\'A 



(10) 



Here the coefficients /3 describe the frequency dependence 
of the wavenumber 

(3{uj) = (3o + Pii^ + Y"^^ + O [uj^] (11) 

where uj is measured from the center of the band ujq. 

The equation (|l^) neglects the effects such as the stimu- 
lated Raman scattering and the stimulated Brillouin scat- 
tering]^, compared to the Kerr nonlinearity of the refrac- 
tion index of the fiber, represented by the term 7 \A\^ A. 

The optical amplifiers incorporated into the communi- 
cation system (see Fig. 1) compensate for the power 
losses in the fiber, but due to spontaneous emission 
each of them will inevitably introduce noise n{t) = 
exp {iujot) J duj n^^ exp {iujt) into the channel. Generally, 
even in a single optical amplifier, the noise distribution at 
any given frequency ujq + uj within the channel bandwidth 
n {uj) is close to a Gaussian; 



Pn [n{uj)] 



exp 



|n(w) 



P! 



N 



(12) 



This is even more so in a system with many independent 
amplifiers, due to the Central Limit Theorem. For simplic- 
ity, the noise spectrum Pj^ can be assumed to be flat. 

If the envelope function just before the amplifier is 
A {t) = A^ exp {iujt) , then immediately after the amplifier 



A. 



exp 



(f^) 



(13) 



where d is the span of a single fiber. 

The equations ([lO|), ( p^ define the evolution of the elec- 
tric field envelope over one "fiber-aplifier" link of the com- 
munication system. The total "input-output" relation will 
then involve solving the corresponding equations for all N 
iterations of the single fiber-amplifier unit. 

IV. The Perturbative Framework 

If one is able to calculate the "output" signal y{uj) in 
terms of the "input" x{u}) and the noise contributions of 



each of the amplifiers n, 



{a} 



y{uj) = $ {x{uj);nj'^ 



a — 1, 
} 



{N} 



(14) 



then the the conditional distribution p{y\x) can be simply 
calculated as follows: 



p{y\x) 



n 



N-l 
a=l 



VnJ'^^ Pn nj'^^ 



Pn y{^) - 4" {x{uj);n^''''^ , . 



{N} 



15) 



where p„ is the distribution function of the noise, produced 
by a single amplifier. The output distribution py {y) can 
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then be directly related to the input distribution px {x) via 
the standard relation 



Py (y) = / T^x{uj) P[y{i^)\x{uj)] Px [x{uj)] 



(16) 



Using Eqns. (|T1),®, one is able to express the mutual 
information in terms of a single distribution . The calcu- 
lation of the channel capacity then reduces to a standard 
problem of finding the maximum of a (nontrivial) func- 
tional. 



The equation (10) is, in fact, the well studied ||3| non- 
linear Shroedinger equation, with the time and distance 
variables interchanged. Only some partial solutions of this 
equation are known, corresponding to solitons|^, Q. How- 
ever, in order to calculate the channel capacity, one needs to 
find the general input-output relation for the communica- 
tion system. This implies solving a set of N essentially non- 
linear equations (^) for arbitrary initial conditions. Even 
knowing some partial solutions, doing such calculation ex- 
actly for an essentially nonlinear system is not possible in 
a closed form. 

In order to make progress, we note the presence of a nat- 
ural perturbation parameter in the problem, namely 7. In 
fact, the fiber equation ( p^ ) is already an approximation, 
derived in the limit, when the change in the effective re- 
fraction index due to pulse propagation, described by the 
nonlinear term 17!^!^^, is small compared to the "un- 
perturbed" value of the index of refraction no- We have 
developed a perturbative technique, when the solution of 
the nonlinear evolution equation, is represented as a power 
series in 7. Solving ([l0| ) separately for each power of 7, 
and using (^3|), for the input-output relation of a single 



fiber-amplifier unit $ 



defined as 
$(») ( ^("-1) 



(17) 



we obtain: 



A( 



X e^q) {—in^d) + rii^ 

where d is the length of a single fiber, 

1 2 



(18) 



(19) 



The procedure for the calculation of the functions ' , de- 
scribed in detail Appendix ^ can be carried to an arbitrary 
order i. 

The further calculation then involves the following steps: 
• Iterating Eq. (|l8|) N times, to obtain the "input- 
output" relation for the whole communication system 
$ 



I \ (1) 



(AT) 



substituting the result into Eqns. (|T^), ( |l^ ) to obtain 
the conditional distribution p(x\y) and the output dis- 
tribution pj, (y) in terms of the input distribution pa;(a;) 
as expansions in powers of 7 



• calculating the entropies iJ \y (uj)\ and H \x {iij)\y {uj)\, 
and the mutual information K 

Following these steps, the calculation of the channel ca- 
pacity becomes a straightforward procedure. In Appendix 
^ we describe it in detail, using a simple nonlinear channel 
y (lo) = X (oj) exp (— </> [x (w)]) -I- n {uj) as an example. 

V. The Fiber Link Channel Capacity 

After a tedious, but straigthforward calculation, we ob- 
tain: 

H [y {uo)] = Ho [y {uj)] - ACi - AHy + O (7^) (20) 



and 



H[y{Lj)\x{Lj)] = Ho[y{cu)\x{u;)]+AC2 + 0{j*pl) 

where Hq [y] and Hq [x\y] are given by the standard expres- 
sions for a linear channel j^]. In the limit of large signal-to- 
noise ratio Pq ^ Pjv we obtain: 



ACi = N'W I ^ 

a 



1 ad, 



AC2 = ^(iV^-l)W^(2^) Q^\ad,^^^ 



a 



(22) 
(23) 



and the functions Qi and Q2 are defined as follows: 

dx 



Qi{u,z) = 

X f {u,z;xi,X2,x) 



/•1/2 


.1/2 . 


/ dxi 


/ dx2 


1-1/2 


1-1/2 Jl 





/•1/2 


,1/2 


nl/2 


Q2{u,z) = 


/ dxi 


/ dx2 


' dx 




J-1/2 J 


-1/2 J 


-1/2 



X f {u,z;xi,X2,x) 
where x = max [1/2, 1/2 + xi + X2], and 

1 — exp (— 7i — iv'^ 



f {u,z;xi,X2,x) = 



where 



V = z {x ~ Xl) {x — X2) 



(24) 



(25) 



(26) 



(27) 



The correction 



AHy = j'W / Vy (c.)4°) [y (co)] (p« [y (cu)]) (28) 



is caused by the deviations of the otput distribution 

Pv = Py 

from the Gaussian form 



- -'"'^J:YpI!'^ (29) 



Py'' [y i^)] ~ exp 



P., + 



(30) 
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where P^j (such that Pq = J dioP^ 
frequency w: 



is the input power at 



exp 



pO 



(31) 



Note, that the correction Aff^ > and equals to zero only 

when the distribution = 0. Therefore, as follows from 
Eqns. ([2C|) , (p8|) , in the second order in nonlinearity 7 the 
mutual information R has the maximum, when p^y^ — 0, 
or, equivalently, when the output distribution is Gaussian 
up to the first order in nonlinearity. 

For a general nonlinear channel, that would correspond 
to the input distribution, being non-Gassian already in the 
first order in 7. The corresponding correction can be ob- 
tained from Eq. (p^, taken only up to the first orded in 
nonlinearity: 



d_ 

Generally, 
pf' {x) (1 



'Dx{lo) p[y{uj)\x{uj)] px[x{uj)] 



= 0(32) 



such an integral would yield Px {x) ~ 
- ^p^x^ (a;)^ , where pi"' is Gaussian, and p^x^ ^ 
0. However, it is straightforward to show, that for the fiber 
optics channel described by Eq. ( |lo| ) , a Gaussian input dis- 
tribution leads to non-Gaussian corrections in the output 
distribution starting only from the second order. There- 
fore, the requrement p^y^ = is satisfied, when the input 
distribution is such that p^x^ — 0. 

For the channel capacity, defined as the maximum value 
of the mutual information, we obtain: 



C 



W\og 1 



Pq_ 

Pn 



ACi - AC2 + O (7^) (33) 



The equation ( pSj ) yields the result for the fiber optics chan- 
nel capacity in the second order in the nonlinearity 7. In 
the next section we will discuss the physical origins of the 
corrections ACi and AC2. 

VI. The Discussion and the Conclusions 

In the spirit of the Shannon formula, the decrease of the 
capacity of a communication channel with a fixed band- 
width can be attributed to (i) the effective suppression of 
the signal power, and (ii) the enhancement of the noise. 
The corrections to the channel capacity, derived in the pre- 
vious section, can be interpreted as resulting precisely from 
these two effects. 

The four- wave scattering!^, induced by the fiber non- 
linearity, inevitably leads to the processes, which gener- 
ate photons with the frequencies outside the channel band- 
width. Such photons, are not recorded by the "receiver", 
and are lost for the purpose of the information transmis- 
sion. This corresponds to an effective bandwidth power 
dissipation, and should therefore lead to a decrease of the 
channel capacity. Since for small nonlinearity this power 
loss AP ~ 7^, the dimension analysis implies 



AP 



fPlpfPW^ 
\ a 



(34) 
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Fig. 2. The corrections to the channel capacity, ACi and AC2, in 
units of W N'^ 'y'^ Pq a~'^ , shown as functions of \(i2\W'^ /a in the 
limit ad ^ 1, S> 1- The correction AC2 is represented by the 
solid line, while ACi corresponds to the dashed line. Note, that 
ACi, which describes the effect of the power leakage from the 
bandwidth, is more strongly affected by the dispersion 



Such scattering processes are suppressed, when the scat- 
tering leads to a subsantial change of the total momen- 
tum 5k [bJi, UJ2 — > W3, w], so that the corresponding scatter- 
ing rate 



5 [uJl + UJ2 — UJ3 — Uj) 

1 -I- (Sk/kq)'^ 



(35) 



In the spirit of the uncertainty realtion, kq ~ 1/Peff, where 
Peff corresponds to the length of the concentration of the 
power of the signal in the fiber. For a small absorption 
coefficient a <C l/d the distance Les is of the order of 
the fiber length d, while in the opposite limit a ^ 1 the 
effective length icff ~ l/a. 

Using Eqn. ([l9|), and the energy conservation u!^ — u!i + 
L01 — OJ, the momentum change 5k, [u>i,uj2 — > ^3,0;] can be 
expressed as 



5k 



(32 (W - UJl) [lO - LOl) 



(36) 



Substituting (|36| ) into (P5|), for the channel capacity loss 
due to the bandwidth power "leakage" , in the limit Pq ^ 
P/v, and ad ;S> 1, we obtain 



AC. 



W- 



.AP 



W 



duji 



dwo. 



/ duj S [llIi,UlI2 ^ t^3,^] 



w Jw 



duJn 



w 



W 



tw 



duJi 



w 



duj2 



w 



duj 



1 -I- {132/oif [UJ - LOlf {lJ - 1^2)^ 



(37) 



which in the appropriate limit is consistent with ACi . 

In Fig. 2 we plot the dependence of ACi on the dimen- 
sionless parameter (32W^ /a. Since momentum change 5k 
is proportional to /32, the increase of the dispersion leads to 
a strong suppression of the power leakage from the band- 
width window, and of the corresponding correction to the 
channel capacity. 
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In a communication system with many "fiber- amplifier" 
units, the fiber nonUnearity leads not only to the mixing 
of the signals at different frequencies, but also to the mix- 
ing of the signal with the noise. Qualitatively, this would 
correspond to an effective enhancement of the noise power 
in the system, and therefore to a loss of the channel capac- 
ity. This effect is not present, when the system has only 
one "fiber-amplifier" link, which explains the appearance 
of the {N -1) factor in AC2 and AC3. 

The effective noise enhancement is caused by the scat- 
tering processes, which involve a "signal photon" and a 
photon, produced due to spontaneous emission in one of 
the amplifiers. The total power of this extra noise can be 
expressed as 



AP, 



N 



2P0PN 



7 



w 



duj-i / duj 



w 



X 5 [lJi,CJ2 W3,^] 



(38) 



The corresponding correction to the capacity 



AC, 



W- 



N 



duJi j di02 / doj 
a' Jw Jw Jw 

^ (39) 



1 + (/?2/a)^ (w - UJl)'^ {UJ - UJ2f 



N 



where we assumed ad ^ I. In this limit ( p9D is up to a 
constant factor identical to AC2. 

The dependence of AC2 on (32W'^ /a is also shown in Fig. 
2. Note, that AC2 also decreases with the increase of the 
dispersion, but more slowly than ACi. Since the scattering 
processes, which contribute to ACi, need to "move" one of 
the frequencies out of the bandwidth window, they gener- 
ally involve a substantial change of the total momentum, 
and are therefore more strongly affected by the dispersion. 

The two physical effects, described above, determine the 
fundamental limit to the bit rate for a fiber optics commu- 
nication system. As follows from our analysis (see Fig. 2), 
the relative contributions of ACi and AC2, often referred 
to as the "four- wave mixing" , can be suppressed by choos- 
ing a fiber with a large dispersion, or when using a larger 
bandwidth. 

In our analysis, we treated the whole available band- 
width as a single channel. As a result, the cross- 
phase modulation^, which severely limits the performance 
of advanced wavelength-division multiplexing systems 
(WDM), does not affect the channel capacity. The rea- 
son for this seemingly contradictory behaviour, is that in a 
WDM system, the "receiver" , tuned to a particular WDM 
channel, has no information on the signals at the other 
channels. Therefore, even in the absense of the "geniune" 
noise, the nonlinear interaction between different channels, 
leading to a change in the signal in any given channel, will 
be an effective noise source, thus limiting the communica- 
tion rate. This limit however is not fundamental, and can 
be overcome by using the whole bandwidth all together. 

In conclusion, we developed a perturbative method for 
the calculation of the channel capacity for fiber optics com- 
munication systems. We obtained analytical expressions 



for the corrections to the Shannon formula due to fiber 
nonlinearity. We have shown that, compared to the Shan- 
non limit, the actual channel capacity is substantially sup- 
pressed by the photon scattering processes, caused by the 
fiber nonlinearity. 

Appendix 

I. Perturbative Solution of the Propagation 
Equation 

In this Appendix we describe the perturbative solutuion 
of the nonlinear equation ( [lO| ) with the boundary condition 



(40) 



A(0,t) — j dw X (uj) exp (iiot) 
We represent A {z, t) as a power series 



A{z,t) — J duj exp (j,LLjt 



+ iKj zj ^7 



ri=0 



(41) 



where is defined is ( |19[ ). Substituting (41) into Eqns. 
(|l^),(|4^), we obtain: 
(i) for £^0 



dTp {z,uj) 
dz 







(42) 
(43) 



(ii) for £ ^ 

dTj {z,uj) 
dz 



£—1 

i ^ (S£_i,^i+^2+^3 / duJi / duj2 

<'l,«2,«3 = l 

X Tti {z, LOi) Ti^ (z, U2) Tl^ [z, ux+loy- t^) 
X exp [i (kj^j + — >^uji^u>i—u> 'Sw)] (44) 
= (45) 



where 6 in the Kronekker's delta- function. 

For any these equations reduce to linear first order 
differential equation, and can be solved straightforwardly. 
For example, the solutions for the first three terms in the 
anzats ( ^l| ) are given by: 

Tp{z,uj) = x(w) (46) 
Tx[z,u:) = J di^i J duj2 F^^^^x {uji) X {UJ2) 

X X* {ui +UJ2-L0) (47) 

X X* (wi) X* ([^2) X {uJi + ijJ2 — oj) X (ws) 

X X (w -I- w - W3) + H^i„2„3^x (wi) X (W2) X (W3) 
X X* {uji uj2~ oj) X* {u; + UJ3 — uj)] (48) 

Here the functions F, G and H are given by 



F'^ 



1 — exp (- 



-ao 



a + iq 



6 



— F'^ ] ( 



X (l-exp(-ad + z0::;^d)) 



pi 



h: 



X (l-cxp(-2arf+(C,.,+C^)d)) 
X - exp {-ad - i<j>'^^J)) 



pi 



X (l-exp(-2ad-*(C,.,+'^:^.3)rf)) 



where 



UJ2 '^UJi-\-iJ2—i^ 



II. The Perturbative Calculation of the 
Channel Capacity 

In this Appendix, we describe the perturbative calcula- 
tion of the capacity of the simple nonlinear channel 



y {uj) — X (w) exp {i^4> i^)]) + {'^) 



(49) 



Here (j) [x\ is an arbitrary real function, and the noise n [lo) 
is a Gaussian random variable: 



Pn [n (io)] 



exp 



N 



(50) 



The fact, that the noise in this model is additive, implies 
that the conditional distribution p{y\x) is fixed, and de- 
fined by the noise distribution p„: 



p{y\x) = p„[y-a;exp(i70(a;))] 



(51) 



It is therefore straightforward to show, that the entropy 
iJ does not depend on 7: 



H[y\, 



dyp,, [y - a;e^^^(-)) 

= -J dzpn [z) log [p„ (z)] 



X lOE 



= log[2^ePAr] 



(52) 



In order to calculate the entropy H [y] , we represent the 
output distribution as a power series in 7: 



Py iv) = pI (y) 



1 + (y) 



(53) 



where Py{y) is the "unperturbed", Gaussian distribution 



p'yiy) = 



1 



N 



■ exp 



12/1 



Po + Pi 



N 



(54) 



corresponding to the linear channel y = x + n. Substituting 
( ^ ) into the definition of the entropy Hy, Eq. we 
obtain: 



log [27re (Po + Pn)] -\ j pf'^ {yfpl {y) 



Po + Pn 



dy\y\ Py{y) 



dy \y\'^p"y (y) 



(55) 



The second term in Eq. (|55|), (1/2) J py^'' (y)^ Py (y), repre- 
sents the difference of the output distribution from Gaus- 
sian, and corresponds to the contribution AHy in Eq. (po|). 
Note, that in the second order in nonlinearity the devia- 
tions of the output distribution from Gaussian lead to a 
decrease of capacity. 

The third term, 

{1/{Po + Pn)) j dy \y\^ Py (y) - j dy \y\^ p'l (y) , is propor- 
tional to the change of the output power, / dy \y\^ Py jy), 
due to nonlinearity, and corresponds to ACi in Eq. (p(]|). 
Generally, the nonlinearity leads to energy exchange be- 
tween different degrees of freedom in the channel (e.g. be- 
tween different frequencies), and to the power leakage out 
of the bandwidth window. However, for the specific (nad 
non-generic) example, chosen in the present Appendix, this 
exchange is absent, since the output power 



{\yf 



(N') + (H') 



Po + Pi 



N 



(56) 



does not depend on the nonlinearity. 

Substituting ( p6| ) in Eq. (55), and using Eq. (p2|), for 
the mutual information R we obtain: 



R = log 



1 



Po_ 

Pn 



l I p'yHyrp'iy) (57) 



As immediately follows from Eq. (|57|), the channel capac- 
ity, equal to the maximum of the mutual information, is 
given by the Shannon formula (Q), and is achieved when 

P«(y) = (58) 

The next step is to calculate the input distribution 



Px (x) 



P°Ax) 



l-f^p(") (x) 



(59) 



corresponding to (p8|). The general relation between the 
input and the output distributions is defined by the condi- 
tional distribution p(y\x): 



Py iy) ^ dx p {y\x) p^ (x) 



(60) 



and, considered as an equation for p{x), is a Fredholm in- 
tergal equation of the first kind. Note however, that since 
Eq. ( |5^ ) represents not the whole output distribution, but 
only it's first order term p]^\y), we can expand Eq. (pO[) 



7 



and keep only the terms up to the first order in 7. We 
obtain: 



dx p{y\x)\ p° {x) (a;) 



+ / dx ■^p{y\x) 



pI {x) = 0, 



(61) 



7=0 



Substituting here the conditional distribution from Eq. 
51]), we obtain: 



dx pn (y - x)p" ix)pi (x) = — 



X J dx pn{y-x)pl{x)(j){x){x*y~y*x), (62) 

Using the identity 

yPn{y~x) = (^ + PN-^^Pn{y~x), (63) 

and integrating by parts, we can represent the right hand 
side of ( |6^ ) as follows: 

i I dx pn{y- x)pI (x) -^(f) {x) {x*y - y*x) = 



Pn 



>{x) d(j){x) 



dx* --"^^ 



Therefore, as follows from Eqns. (|62| ) and (|6J), the input 
distribution 



Pi'^ {x) 



dx 



{x) d(f) {x) 
dx 



(65) 



This procedure can be followed up for all orders in 7. By 
a direct calculation, it is straightforward to show, that the 
channel capacity is represented by the Shannon result (^, 
which is achieved when for any n > 1 



pi"^ (x) = 



(66) 



Subsitituting (|66|) and ( |65[ ) into Eq. ([59D, for the input 
distibution we finally obtain: 



Px (x) 



1 

X exp 



1 , ■ , *-Ax) d(f){x) 
1 + Z7 a; — : x 



dx* 



dx 



\x\_ 



with the corresponding channel capacity 



C = Wlog 



1 



P, 



N 



(67) 



(68) 



This result has a simple physical meaning. When the in- 
put distribution is organized in such a way, that the quan- 
tity z — X exp (ijcj) (x)) has the Gaussian distribution, then. 



considering z as input, the communication channel be- 
comes linear: y — z + n, and the channel capacity is there- 
fore given by the Shannon formula ([l]) , (|68|) . The corre- 
sponding input distribution is then defined by the Jacobian 
of the transformation from x = xn + ixi to z = z^ + izi, 
d{zii,zi) /d{xR,xi) (note, that xr, xj, z^, zj are defined 
as real variables) : 



Px (x) 



1 d {zr, zi) 



exp 
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Vxj\ 



nPo d{xR,xi) 

which reduces to the distribution (|67D, since 
d{zR,zi 



(69) 



d {xR, xi) 



= 1 
= 1 



d(j){xR,xi) d(j){xR,xi) 

IX R 7: 72:/- 



dxi 

. ^d<j){x,x*) . 

T-IX l-fX 



dxR 
dcj) {x,x*) 
dx 



(70) 



This result should be contrasted to the so called "Gaus- 
sian estimate" of the channel capacity]^. In the latter 
appoach, the information channel is described by the joint 
Gaussian distribution 



exp - [xl y*J\ A 



Xu 



where 



A = 



{xlx^) {xly^) 

{y^xu^) {yly^) 



(71) 



(72) 



The channel capacity is then estimated as the mutual in- 
formation, corresponding to the distribution (|7l|): 



Cg 



du) log 



{xZx^){yZyuj) - {xly^){y*^x^ 



(73) 



Under the constraint of the fixed input power J da;(|a:^|^), 
the estimate (^3[) was shown|^ to give the low bound to 
the channel capacity. 

For the model channel considered in the present Ap- 
pendix, the "Gaussian estimate" yields an expression, dif- 
ferent from the Shannon result. For example, when (j) [x) — 
we obtain 

Cg = -Wlog 

= Wlog 



Po 



iPo + PN)il + l'PSf 



1 



3l 

Pn 



2Wj'pS^ + 0{j') (74) 



which, as expected, is smaller that the actual channel ca- 
pacity (|68|). Note, that the difference between the exact 
channel capacity and the Gaussian estimate 



SC = C-Cg^WIou 



Pn 



1 - 



1 



(1 + 7^^0^) 



= 2Wj'P,'^ + 0{j^) 



P 



N 



(75) 
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is not merely a constant scale factor, but a nontrivial func- 
tion of the signal to noise ratio, and the nonlinerity. 

Even when the input distribution is Gaussian, like e.g. 
when the phase (j> depends on x via the "power" \x\^ , the 
Gaussian Estimate does not yield the exact result. The 
reason for this behaviour is that the joint Gaussian distri- 
bution does not correctly reproduce the conditional distri- 
bution p{y \x). 

For an essentially noinlinear system (e.g. a fiber op- 
tics communication channel), there is generally very little 
apriori knowledge about the parametric dependence of the 
Channel Capacity on the signal to noise ratio and other 
system parameters. In this case, the Gaussian Estimate 
for the channel capacity can be (should be?) viewed as a 
very unreliable method, as there is no way to separate it's 
artefacts from the actual behaviour of the channel capacity. 
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